Goto

Collaborating Authors

 metal cylinder





LLMs Can Plan Only If We Tell Them

Sel, Bilgehan, Jia, Ruoxi, Jin, Ming

arXiv.org Artificial Intelligence

Large language models (LLMs) have demonstrated significant capabilities in natural language processing and reasoning, yet their effectiveness in autonomous planning has been under debate. While existing studies have utilized LLMs with external feedback mechanisms or in controlled environments for planning, these approaches often involve substantial computational and development resources due to the requirement for careful design and iterative backprompting. Moreover, even the most advanced LLMs like GPT-4 struggle to match human performance on standard planning benchmarks, such as the Blocksworld, without additional support. This paper investigates whether LLMs can independently generate long-horizon plans that rival human baselines. Our novel enhancements to Algorithm-of-Thoughts (AoT), which we dub AoT+, help achieve state-of-the-art results in planning benchmarks out-competing prior methods and human baselines all autonomously.


Multiset-Equivariant Set Prediction with Approximate Implicit Differentiation

Zhang, Yan, Zhang, David W., Lacoste-Julien, Simon, Burghouts, Gertjan J., Snoek, Cees G. M.

arXiv.org Machine Learning

Most set prediction models in deep learning use set-equivariant operations, but they actually operate on multisets. We show that set-equivariant functions cannot represent certain functions on multisets, so we introduce the more appropriate notion of multiset-equivariance. We identify that the existing Deep Set Prediction Network (DSPN) can be multiset-equivariant without being hindered by set-equivariance and improve it with approximate implicit differentiation, allowing for better optimization while being faster and saving memory. In a range of toy experiments, we show that the perspective of multiset-equivariance is beneficial and that our changes to DSPN achieve better results in most cases. On CLEVR object property prediction, we substantially improve over the state-of-the-art Slot Attention from 8% to 77% in one of the strictest evaluation metrics because of the benefits made possible by implicit differentiation.


Object-Centric Learning with Slot Attention

Locatello, Francesco, Weissenborn, Dirk, Unterthiner, Thomas, Mahendran, Aravindh, Heigold, Georg, Uszkoreit, Jakob, Dosovitskiy, Alexey, Kipf, Thomas

arXiv.org Machine Learning

Learning object-centric representations of complex scenes is a promising step towards enabling efficient abstract reasoning from low-level perceptual features. Yet, most deep learning approaches learn distributed representations that do not capture the compositional properties of natural scenes. In this paper, we present the Slot Attention module, an architectural component that interfaces with perceptual representations such as the output of a convolutional neural network and produces a set of task-dependent abstract representations which we call slots. These slots are exchangeable and can bind to any object in the input by specializing through a competitive procedure over multiple rounds of attention. We empirically demonstrate that Slot Attention can extract object-centric representations that enable generalization to unseen compositions when trained on unsupervised object discovery and supervised property prediction tasks.


Mysterious chunk of 'Chinese' space debris crash lands in Myanmar

Daily Mail - Science & tech

Is this part of a secret spy satellite? The metal cylinder is 4.5 metres long and just over a metre in diameter Another smaller piece of metal with Chinese writing fell nearby The incident comes shortly after Beijing launched a satellite into space It has not been confirmed whether the launch of the satellite and the metal objects found were related The metal cylinder is 4.5 metres long and just over a metre in diameter From exploding phones to dangerous hoverboards: Why are... Did fake news on Facebook swing the US election? The AI that learns by PLAYING: Google DeepMind is making... From exploding phones to dangerous hoverboards: Why are... Did fake news on Facebook swing the US election? The AI that learns by PLAYING: Google DeepMind is making... The barrel-shaped object crashed onto property owned by a jade mining company in Kachin State's Hpakant township on Thursday The bizarre events came the same day Chinese state media reported Beijing had recently launched a satellite into space.